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Abstract. We study the role of strategy correlations and timing of adaptation for the dynamics of Minority 
Games, both simulationally and analytically. Using the exact generating functional approach a la De 
Dominicis we compute the phase diagram and the behaviour of batch and on-line games with correlated 
strategies, complementing exisiting replica studies of their statics. It is shown that the timing of adaptation 
can be relevant; while conventional games with uncorrelated strategies are nearly insensitive to the choice 
of on-line versus batch learning, we find qualitative differences when anti-correlations are present in the 
strategy assignments. The available standard approximations for the volatility in terms of persistent order 
parameters in the stationary ergodic states become unreliable in batch games under such circumstances. 
We then comment on the role of oscillations and the relation to the breakdown of ergodicity. Finally, it 
is discussed how the generating functional formalism can be used to study mixed populations of so-called 
'producers' and 'speculators' in the context of the batch minority games. 
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1 Introduction 

The collective behaviour of interacting heterogeneous and 
adaptive agents has attracted a substantial amount of at- 
tention in the statistical physics communtity in recent 
years. The general aim is to understand how complex 
macroscopic co-operation can emerge from the underlying 
microscopic laws that govern the behaviour of the individ- 
ual agents. The minority game (MG) is probably one 
of the most studied models in this context. It describes an 
ensemble of traders who at each time step receive a piece 
of public information and react by either buying or selling. 
Learning from past experience, the aim of the individual 
agent is to be in the minority at each round of the game, 
i.e. to buy when most of the traders are selling and vice 
', versa. To take their trading decisions the agents employ 
strategies assigned randomly at the start of the game and 
then kept fixed. These effectively act as look-up tables pro- 
viding a map from the observed public information onto 
a binary trading decision. At each time step the individ- 
ual agent employs the strategy in his or her pool he or 
she believes will maximise their potential payoff. In order 
to measure the performance of their strategies they allo- 
cate virtual points to each of their strategies, increasing 
the score of strategies that would have yielded a correct 
minority decision. 

Thanks to the simplicity of the microscopic equations 
of motion and because of analogies to models of spin- 
glasses and neural networks, analytical progress can be 
made using both static and dynamical methods of sta- 



tistical mechanics. It has been found that the relevant 
control parameter for MGs is the ratio a = P/N of the 
number P of possible values for the external information 
over the number of players N. A phase transition between 
an ergodic phase at high a and non-ergodic states be- 
low a critical value a c has been identified and analytical 
expressions for the macroscopic order parameters charac- 
terising the different states have been obtained PH^|f4"||^)| 
ElEj. While some order parameters in the ergodic phase 
can be computed in the thermodynamic limit N — > oo 
without making any approximations, the dynamics of the 
game in its non-ergodic phase at low values of a is not yet 
fully understood. Furthermore the calculation of a key ob- 
servable of the MG, the magnitude of the overall market 
fluctuations or so-called volatility, is up to now restriced 
to approximations both in dynamical approaches and in 
static replica analyses [HIE] • 

In the original setup of the MG the strategies assigned 
to a given agent were completely uncorrelated and the 
information fed to the agents was based on the real his- 
tory of the market dynamics 0|Hj. Also, updates of the 
virtual scores of the agents' strategies were performed 
after each round of the game, and their relative success 
re-assessed before each trading decision, i.e. agents could 
switch strategies between any two successive rounds of 
the game. Such variants of the MG with score updates at 
every round of the game are generally referred to as 'on- 
line' games. It was realised that no qualitative changes 
in the overall behaviour occured when the stream of in- 
formation was replaced by random inputs at each time 
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step j|. Hereafter we shall therefore distinguish between 
on-line games with 'real' or 'random' market history, re- 
spectively. It is technically much easier to deal with the 
case of 'random' history so henceforth we concentrate on 
this version (see however the recent work ^U] for a gener- 
ating functional analysis of MGs with real market histo- 
ries) . Early studies showed only slight differences between 
on-line MGs with uncorrelated strategies and correspond- 
ing games in which the agents effectively sample a large 
set of information patterns before updating the scores of 
their strategies. This amounts to replacing the inflowing 
information by an averaging procedure over all possible 
values of the pieces of information to yield an effective 
interaction between the agents; such models are known 
as 'batch' minority games. On the technical level, using 
this effective interaction, the dynamics of batch MGs are 
much easier to study than their on-line counterparts, and 
hence most current dynamical studies are concerned with 
batch versions of the MG [rjlllllll2lll3lll4lll5| . Also, here 
we restrict ourselves to the case of deterministic decision 
making; a stochastic extension is straightforward to con- 
sider , but complicates the detailed mathematics of the 
analysis. Reviews on the MG can be found in |17U18j : note 
also the recently published textbooks ^HIGHI- 

In this paper we extend the generating functional anal- 
ysis of batch and on-line games with uncorrelated strate- 
gies as presented in |6U21| to the case of correlated and 
anti-correlated strategies. The purpose is twofold, firstly 
we provide an analysis of the dynamics of such MGs with 
so-called 'diversified' strategies and complement the study 
of their statics previously presented by other authors in 
2\ . Secondly, we find that in the presence of anti-correlated 
strategies significant differences in the global behaviour of 
batch and on-line MGs are present, i.e. that the the overall 
performance of the system strongly depends on the timing 
of adapation of the agents. This effect is small in the case 
of conventional MGs with uncorrelated strategies, but be- 
comes magnified as the degree of anti-correlation in the 
strategy assignments is increased. We also demonstrate 
that some approximations which reproduce the the volatil- 
ity of batch MGs with uncorrelated strategies with good 
accuracy, and are hence often considered to be standard, 
become inaccurate in games with highly anti-correlated 
stategies. We then discuss the effects of strategy corre- 
lations on the sensitivity to initial conditions and com- 
ment on the relation between the global oscillations and 
the breakdown of ergodicity in batch and on-line MG with 
different strategy correlations. As a by-product of the gen- 
erating functional analysis of the dynamics of games with 
general distributions of correlation parameters we finally 
study the case of bi-modal distributions, corresponding to 
a mixed population of so-called 'speculators' and 'produc- 
ers' 



2 Model Definitions 

The MG describes the decision making dynamics of N 
interacting agents, i — 1, . . . , N. At each round £ of the 
game all agents are given the same external information 



I(£) taken from a set of P elements. They utilise this in- 
formation to determine an action by choosing from one of 
a personal set of S strategies whose operation on the in- 
formation yields a decision. Simulations have shown that 
restriction to S — 2 strategies per agent yields characteris- 
tic behaviour. It also simplifies the analysis. Hence we re- 
strict the further discussion here to this case. We also sim- 
plify the choices of the space of information and character 
of the decisions. Hence here we take each agent to hold 
two P-dimensional strategy vectors Ri a = (Rj a , ■ ■ ■ , Rfa), 
a = ±1, with each component R? chosen randomly from 
the set {—1,1} at the beginning of the game and there- 
after fixed and the information to be given by an integer 
fi(£) chosen randomly and independently at each step £ 
from the set {1, ...,P — aN}. Then the strategies act 
as look up tables for deciding 'trading' action, yielding 

output R^ • If player i decides to use strategy a*(£) at 
round £ of the game his or her trading action will be 
bi(t) = Rfa*m- The re-scaled total bid at stage I is then 

defined as A(£) — A -1 / 2 bi(£). In order to decide which 
of their two strategies to use at any given time each play- 
ers keeps a record of the relative performance of his or 
her strategies. A payoff value Pi a {£) is assigned to each 
strategy and updated every M time steps according to 

t+M-l 

Pia (l + M)=p ia (£)-^= £ RtpA{£'). (1) 

In between the updates the scores pi a are kept constant. 
r > is a learning rate and of order 0(N ), introduced 
for convenience Note that the minus sign in the up- 
date prescription ensures that strategies which predict 
correct minority decisions are rewarded. At each round 
player i then plays the strategy in his or her arsenal with 
the highest relative score, i.e. a*(£) = sgn[(fc(£)], where 
qi (£) — p i _ 1 (£)) is the point difference of player 

i's strategies at time £. Upon introduction of oji = [R% t i + 
Ri ,-i)/2 and & = - R4 ,_i)/2 the update rule (JTJ 

can be compactificd to 



i+M-l 



qi {£ + M)= qi {l)- 




xU-y^^f >+ef '),.(£'))}. (2) 

j 

Here we have introduced the shorthand notation Si (£) — 
sgn[qi(£)], we will also abbreviate fl = N~ 1/>2 £3 . <^j- The 
case M = 1 (where strategy scores are updated at ev- 
ery time step) is referred to as the on-line model. For 
M > 0(aN) one expects (and simulations confirm) the 
behaviour of the system to be the same as that for the 
so-called batch version in which the sum over the ac- 
tual values /i(^') is replaced by an average over the set 
fie {I,.. .,aN}: 

aN 

qi ( t + 1) = qi (t) er { ]r k + $ aj (t)) }. (3) 

M =1 3 
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Note that an appropriate re-scaling of time is implied, 
as one batch time step corresponds to 0(aN) on-line 
steps. Introducing the notation = ■ £j/N and 

hi = -T^ • n/y/N the batch update rule can be writ- 
ten compactly as 

ft (t + 1) = qi (*) - hi - J2 J H s i (*) • ( 4 ) 

3 

In the original game the strategies are chosen without 
correlations between the agents and also independently 
within the set of strategies of a fixed agent i, i.e. one has 

i*P^ = <%<WW ( 5 ) 

where 7TT denotes an average over the disorder, i.e. over the 
space of strategy assignments. In this paper, we will con- 
sider cases of correlated strategy vectors Ri-\ and R^i 
of a fixed agent i. Specifically we generalize the standard 
case to a situation in which — R^ 2 with probability 
Pi G [0, 1] for any given i and p (and ^ i?f 2 with 
probability 1 — Pi, recall that the i?f a take only values —1 
and 1). The joint probability distribution of and Rf 2 
is then given by 

P(R^ = x, R& = y) = | (4,-1^,-! + 5 s ,i6 VtL ) 

H ^ (S x ,-iS y ,i + S x ,i8y,-i) • (6) 

The standard situation of independent strategies is cov- 
ered as the special case pi = 1/2 for all players i. On the 
other hand, if pi = 1, then player i holds two identical 
strategy vectors, whereas for pi = he or she has two 
opposite strategies, -R^-i = — Ri,i- Allowing the corre- 
lation parameter pi to depend on i adds another layer of 
heterogeneity to the ensemble of agents: not only are their 
strategies chosen at random and hence are heterogeneous 
across the group of N players, but also the probability 
distribution from which they are drawn may be different 
for different agents. We will assume that each pi is ran- 
domly and independently drawn from a distribution P(p), 
so that, for example, P(p) = S(p— 1/2) corresponds to the 
standard game, with pi — 1/2 for all i. Some of our results 
presented below have been summarized in the short paper 
|22| . The statics of MGs with correlated strategies were 
first studied using replica methods in |2J. Numerical re- 
sults are also found in [221 , and in [53] , where an effective 
correlation between the assigned strategies was introduced 
upon biasing the {-Rf a } towards one of the binary entries. 

Let us finally, in this section, introduce the volatility, 
a 2 , of the market. It describes the variance of the total re- 
scaled market bid A, and can be defined as the following 
long-time average: 

a 2 = lim rTilf) 2 . (7) 

In on-line models the relevant average over the stochastic- 
ity of the information is to be performed, as A(£) depends 
on both the score valuations {qi(£)} and the value of the 



external information p(£). In deterministic batch games 
this average is replaced by one over p. 

p 

o 2 = lim r^^EW, (8) 

t<T 1-1=1 

where A»(t) = f2» + A^ 1 / 2 £\ fj* Sj(t). The volatility is a 

measure for the global efficiency of the market; if a 2 = 
supply and demand are always matched (A — 0) and trad- 
ing is fully efficient. Non-zero volatilities however indicate 
mismatches between the numbers of buyers and sellers, 
and hence imply inefficient markets. For the mathemati- 
cal analysis of batch MGs it is also convenient to introduce 
the volatility matrix 3 jS] 

p 

3tt> =P- 1 X^ / W(*0- ( 9 ) 

Note that a 2 = lirn r _» 00 r _1 J2t<r ^tu an d that stochas- 
tic trading with Sj(t) = ±1 taken randomly and indepen- 
dently at any time step t would result in S u > = ^(1 + Stt') 
in the thermodynamic limit. In particular a 2 = 1 is the 
so-called random trading limit. 

3 Generating functional and effective 
single-agent process for the batch game 

We will now turn to the generating functional analysis of 
the batch process (01 ■ The calculation is an extension of 
the analysis of the standard game presented in 0. 

3.1 Generating functional and disorder average 

The moment generating functional corresponding to the 
batch process of Eq. reads 

Z[i>] = J dqp (q(0))exp (i^f/>i(t)8i(t)j 

J\S Li(t+l)-qi{t)+Y^J ijSj (t)+hi-ei{t)j ,(10) 

where we have introduced the usual source term contain- 
ing the field variables {ipi(t)}. The {0i(t)} are additional 
perturbation fields, introduced to generate response func- 
tions, and po(q(0)) denotes the probability distribution of 
the initial score differences q(0) = (c?i(0), . . . , gjv(0)) from 
which the dynamics is started. 

The further analysis is standard and follows the lines 
of [fj. The average over the strategy assignments can be 
performed exactly in the thermodynamic limit TV — > oo, 
leading to a (quenched-strategy-) disorder averaged gen- 
erating functional Z[ij>], from which all dynamical observ- 
ables can be computed upon taking derivatives with re- 
spect to the {4>i(t)} and {9i(t)}, finally taking the limit 
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in which these generating fields go to zero. The relevant 
macroscopic order parameters of the resulting theory are 
the weighted disorder-averaged correlation and response 
functions of the original iV-particle problem: 



Off = lim 



TV ^ 1 



■( Si (t) Si (t')), (11) 



Gtt' 



lim 



_L \ " 1 ~ P* 

N I — p 



p t d(si{t)} 



P d0i(t>) 



(12) 



where (. . .) denotes an average over initial conditions (as 
specified by the distribution p o (q(0)). We use the notation 
~p = f dpP(p)p for the first moment of the distribution 
P(p) of correlation parameters 1 . While the numerator in 
the unusual pre- factors (1 — pi) /(I — p) is a direct conse- 
quence of the averaging procedure over the assignments of 
correlated strategies, the denominator is chosen to ensure 
that the equal time correlation function is equal to one, 
Ctt — l 2 - For later convenience we shall take the inverse 
characteristic time-scale r in the couplings Jij and the 
fields hi to be given by r = 1/(1 — p). 

In an cxtcntion of the procedure for uncorrelated agents, 
by introducing auxiliary macroscopic functions relateable 
to the correlation and response functions, performing sum- 
mations over the microscopic variables and utilising ex- 
tremal dominance in the limit N — > oo, the computation 
results in an equivalence of the original Markovian coupled 
iV-particle dynamics to an ensemble of non-Markovian sin- 
gle agent problems, explicitly involving the correlation and 
response functions C and G and subject to self-consistently 
determined coloured noise. 

Explicitly, the equivalence is to an ensemble of effec- 
tive single agents with characteristic labels p subject to 
stochastic dynamics 



perturbation fields 6i(t) for players i with correlation pa- 
rameter Pi = p are all identical, 9i(t) = 6 p (t), and that the 
initial values qi(0) for agents i £ {j = 1, . . . , N\pj — p} in 
the original TV-particle dynamics are all drawn indepen- 
dently from the same distribution Pop(<Zi(0)), i.e. that the 
distribution of initial values factorizes over agents with 
the same correlation parameter p. 

Note that the different single particle noises {r] p } (for 
different values of p) all have the same covariancc matrix 
Au' , with no explicit dependence on p, but only an implicit 
one on the distribution P(p) through the correlation and 
response matrices C and G. Nevertheless we will keep the 
subscript p in r\ p to distinguish the noise contributions to 
the effective single-agent processes for different values of 
p. The correlation and response functions Ctt 1 and Gu> are 
then to be computed self-consistently as two-fold averages 
over (i) the measure (. . . \p) r generated by the realizations 
of the process {q p (t)}: 

< f[l P ]\p >*= Dq p Dq p p o (q p (0))f[q p } 



q p (t + l) = q p (t)+e p (t) 



1-p 
z — -a 
1-p 



J2(K + G) u }s p (t>) 



t'<t 



1-P 



Sp (t) = sgn[q p (t)] (13) 



with fractional population P(p) and with {rj p (t)} Gaussian 
coloured noise of zero average and covariance 

At, = (v P (t)v P (t')) Vp = [(n+G)- 1 J D(U+G T )- 1 ] fts (14) 

where ll tt / = 8 tt > and G, G and D (D tt > = + C u > for 
all t, t') are determined self-consistently across the ensem- 
ble 3 . In the derivation of (|13fl we have assumed that the 

1 Note that the overbar on p has a different meaning from 
the other overbars, which indicate averaging over the quenched 
disorder choices. 

2 We assume here that ~p < 1, and exclude the case ~p = 1. 
The latter case corresponds to a situation in which all agents 
(except for a non-extensive number) have pi = 1, and hence 
hold two identical strategies. Such a game trivially leads to a 
volatility a 2 — 1, corresponding to the random trading limit, 
and exhibits no phase transition. 

3 A similar situation involving an ensemble of single agent 
processes has been encountered in |13| . where the author dis- 
cusses a population of agents trading with different frequencies. 



exp (iY^q p (t)(q p (t + 1) - q p (t) - 6 p (t))^ 
cxp (ia^J2q p (t)(n + G)-} Sp (t') 

5«Yr| E «p «K n + Gy'DtR + G T r V<7p(0 ) 



(15) 



and (ii) over the distribution P{p) of correlation parame- 
ters. In detail one has 



1-P 



1 


-p 


1 


- p 


1 


-p 


1 


- p 


1 


-p 


1 


- p 



-^ry < s p (t)\p >» 



dpP(p)—^G p (t,t'), 
1-p 



(16) 



(17) 



so that the different processes 111 -it (with different values 
of p) are effectively coupled through the matrices C and 
G, which appear in l|13|l and in the noise correlator (|14fl 
and which at the same time are (weighted) averages over 
the whole ensemble of representative agent processes. The 
original TV-agent dynamics and the self-consistent single 
agent problem are equivalent in the thermodynamic limit 
in the sense that disorder-averaged observables in the orig- 
inal problem (such as the correlation and response func- 
tions f| 1 1 1 112(1 ^1 are identical to the corresponding aver- 
ages obtained from the ensemble of effective single-agent 
processes (Eqs. (1161 117|) for the correlation and response 
functions) . This equivalence extends to other macroscopic 
observables, including the probability for a given agent to 
'freeze' (i.e. to use only one of his or her two strategies) in 
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the long-run. We will use the resulting 'fraction of frozen' 
agents below in the further analysis of the effective agent 
dynamics. 

Due to the presence of the coloured noise {i] p (t)} and c 
the retarded self-interaction term in Eq. Q13f) a direct so- 
lution of the self-consistent problem defined by I|13ll7fl is 
in general impossible beyond the first few time steps. An 
analysis of possible ergodic time-translation invariant sta- 
tionary states is however feasible, and will be presented be- 
low. Alternatively one can resort to a numerical iteration 
of the representative agent problem using a method first 
proposed in 22] ■ This effective Monte-Carlo integration 
of the single agent problem allows one to determine the c 
correlation and response matrices C and G to arbitrary 
precision without finite-size effects, but becomes more and 
more costly computationally as the number of time steps t 
is increased (due to required inversions of matrices of size 
t x t at time step t). 

This direct-iteration procedure does not require any 
further assumptions on the properties of the dynamical 
order parameters, and can be carried out for all values of 
a in both the ergodic and non-ergodic regimes of the game. 
In Fig. H we display results for the volatility as a function 
of a for unimodal distributions of the correlation param- 
eter, pi — p for all i. We find good agreement between 
the data obtained from a direct simulation of the batch 
process (open symbols in Fig. ^) and from an iteration of 
the effective single-agent process (solid markers), respec- 
tively. This confirms the validity of the analytical theory 
derived in this section for general P{p) in the special case 
of unimodal distributions of strategy correlations. 

We will now proceed to analyse the single-effective 
agent process further and to compute persistent order pa- 
rameters in the ergodic stationary state, as well as the 
phase diagrams of batch MGs with correlated strategies. 

3.2 Persistent order parameters in the ergodic 
stationary state 

In order to analyse the ergodic state in the regime of large 
a we will make the following assumptions: (i) the sys- 
tem becomes stationary in the long-time limit, i.e. we will 
assume temporal translation invariance, lim^oo Ct+ T ,t — 
C(t) and lim^oo Gt+ T ,t = G(t), (ii) the absence of anoma- 
lous response, i.e. we will assume that the integrated re- 
sponse x — li m t^oo J2 T <t ^( T ) remains finite and (iii) 
weak long-term memory, i.e. lim^oo Gtv = for all fi- 
nite t' . 

The integrated response \ along with the persistent 
part of the correlation function, c = lim^oo i _1 X)r<t C{t), 
will be the order parameters characterising the ergodic 
stationary states of the game. The further analysis hence 
proceeds by formulating closed equations for these persis- 
tent order parameters. 

Following 6| we introduce the re-scaled quantity q p {t) = 
q p (t)/t, and find from 1)13(1 

q p (t) = - t q p (0) - iz£| £ £(H + G)-,l„s p {t") 
" t'<t t" 




a a 



Fig. 1. Volatility as a function of a for different values of p 
for the batch game (pi = p Vi). Open symbols are from simu- 
lations of the batch process with N = 1000 agents, averaged 
over 10 samples of the disorder, and run for 500 batch time 
steps. Filled markers are results from a direct iteration of the 
single-particle problem using the method of |25| (5 x 10 4 re- 
alisations of the single-agent process are generated, and the 
iteration is performed for 100 time steps). Circles: tabula rasa 
start (gi(0) = 0), squares: biased starts (|<7i(0)| = 1). Solid 
lines are the approximation of l|M40 for the ergodic phase and 
are continued as dashed lines into the non-ergodic phase (where 
the ergodic theory is longer valid). 

+*,+^a/^|E^') ( 18 ) 

" " t'<t 

Here, we have assumed that 9 p (t) = 9 P for all t, so that 
9 P is a static perturbation. Now, upon taking the limit 
t — > oo and writing q p = limt_,oo q p (t), we find 

q P = -a\ — =-rf— + e P + \ a \ — =V P , (19) 
i -pl+x V l-p 

where s p = lim^oo t' 1 J2 T <t s S n [9p( T )] and 
ri p = limt_ >00 t^ 1 J2 T <t ^pM- Note tliat <7p> s p an d Vp are 
random variables, coupled through Eq. (|19fl with each re- 
alization corresponding to a realization of the single agent 
process (|13f) . The variance of the zero-average Gaussian 
variable rj p can be obtained from i|14[> and reads 

(v 2 P ) = iim ^££p + o- 1 D(n + G r )- l ]«, 

' ip t.t'— >oo TT '—^ L — 4 

t<T t'<r' 



The analysis now proceeds along the lines of [S] , and we 
distinguish between so-called 'frozen' and 'fickle' agents. 
The distinction between frozen and fickle agents was first 
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introduced in |26| and is based on observations from nu- 
merical simulations of batch and on-line MGs. One finds 
that some of the trajectories Qi(t) of the original dynamics 
grow linearly in time in the stationary state, qi(t) ~ t, so 
that the corresponding (frozen) agent always employs the 
same strategy. Other (fickle) agents keep switching strate- 
gies, and their score difference remains finite. While the 
distinction between frozen and fickle agents was originally 
made on the level of the initial TV-particle dynamics, the 
same type of trajectories are also found in the realizations 
of the effective single particle process (generated for ex- 
ample using the method of [23)- Frozen effective agents 
can be identified as those realizations of for which 
q p ^ 0. One then has s p = sgn[<j p ]. Using Eq. (fP3j) (set- 
ting dp = 0) this is seen to be the case if \r) p \ > j p , where 

7 P = \J a ^~-?^ - On the other hand, a given realiza- 
tion of the representative agent process is 'fickle' when 
the score-difference q P (t) fluctuates around a finite value, 
with occasional zero-crossings, then one has q p = 0. This 
is the case if \r) p \ < 7 P , and then s p — rjp/jp. Note in par- 
ticular that for fickle (effective) agents s p is a continuous 
variable, — 1 < s p < 1, with a non-zero value indicating 
that the corresponding effective agent employs his or her 
strategies with different frequencies. 

From this the fraction of frozen agents with strategy 
correlation p can be computed by performing the Gaus- 
sian integral over r\ p (with variance given by l|2U|) '): 




Fig. 2. Persistent part c of the correlation function for the 
batch game with pi — p for all i and with tabula rasa initial 
conditions. Connected symbols are data obtained from simula- 
tions for TV = 300 players, measured over 500 batch steps, pre- 
ceded by an equilibration period of 500 steps. The curves are 
p = 0.9, 0.8, . . . , from top to bottom. The solid lines are the 
theoretical predictions for the ergodic regime and have been 
continued as dashed lines by extrapolation of equations 1271 
!2iSt into the non-ergodic phase below a c , where they are no 
longer valid. The slight discrepancy between the theoretical 
lines and the numerical data close to the predicted breakdown 
of the ergodic theory is due to finite-size effects. 



<f>(p) = (6(\v P \-7 P )\p) 



where we have introduced 



Hp) 



1 q(i-p) 

p+(l-p)c 



(21) 



(22) 



9 in (f2*T|) is the step function; 9 (x) = 1 if a; > 0, and 
9(x) = otherwise. The corresponding persistent part of 
the correlation function reads 

c(p)= km i-^Cpto 

T<t 

= (4\ 



i - 4>Q>) i 
Hp) 2 Hp) 



exp i 



-A(p) 2 /2)(23) 



(sp)^ may therefore be written as 



X(P) 



/8s f 



k(1 - p) \<9?7p 



a(l-p) 



(24) 



From these equations the overall persistent correlation c 
and susceptibility x can be computed self-consistently as 



dpP{p) l —^c{pl 
1-p 

X = I dpP(p)^—^x(p)- 



(25) 
(26) 



To this end, note that inserting (|2U and into (|2*3|) 
allows one to express c{p) in terms of c (and the model 
parameters a and p). 



Applying a static perturbation 9 p in (|19f) is (up to a prefac- 
tor yj a(l — p)/(l — p)) identical to perturbing the static 
random variable r\ p . Such perturbations applied in the 
stationary state affect only fickle agents, for which we 
have s p = n P h P - The susceptibility \(p) = Y, T G p( T ) = 



4 4>(p) is the probability for an agent with strategy correla- 
tion p to be frozen. 



4 Unimodal distribution of strategy 
correlations 



We will now first investigate the case of a unimodal dis- 
tribution of strategy correlations, P(p') — S(p — p 1 ), i.e. 
the case where all pi are equal, pt = p, with < p < 1 . 
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4.1 Stationary state in the ergodic regime and phase 
diagram 

In this case the above self-consistent set of equations (|2"5l 
126(1 for the order parameters c and x can be compactified 
considerably using equations i|22U23ll2"l|l and we find (with 
c = c(p), <j) = (f>(p), x = x(p) and A = A(p)): 



,-A 2 /2 



A V 7T 

a 

^(5 



where 



A = 



l-p 



+ c 



(27) 
(28) 

(29) 



Note that p = p in the case of unimodal P(p') = 5(p — p'). 
The fraction of frozen agents is given bycj> = l-erf(A/^2). 
Eqs. H27fH28|) agree with the corresponding equations for 
the static order parameters obtained within a replica sym- 
metric theory [2]. They are easily solved numerically, and 
exact analytical predictions for c, <p and x as functions of 
a can be obtained for < p < 1. Results for the persis- 
tent correlation c are shown in Fig. |2 and we find very 
good agreement with numerical simulations for large a, 
greater than critical values a c (p). The deviations at lower 
values of a — P/N < a c (p) are due to a breakdown of 
the ergodic theory, more precisely of the assumption of 
finite integrated response [[J. The point a c {p) at which 
this happens can be computed from Eq. i|28|) . One finds 
that x ^ oo at a critical value of a = a c (p) given by 
a c (p) = erf(A c /\/2), where A c = \ c (p) fulfills 



erf 



V2 



1 



7T \ c (p) 



-A c (p) 2 /2 



(30) 



Solving this equation numerically leads to the phase dia- 
gram depicted in Fig. El with the critical line in the (a, p) 
plane separating the ergodic phase (x finite) and the non- 
ergodic phase. This phase diagram for the MG with diver- 
sified strategies was first obtained from the corresponding- 
replica calculation in 2 . While macroscopic observables 
such as the volatility are sensitive to the initial conditions 
in the non-ergodic phase, a < a c (p), the starting point is 
irrelevant in the ergodic phase, a > a c (p). We display the 
volatility for so-called tabula rasa starts (qi(0) — for all 
i), and for randomly biased starts (|gj(0)| = qo = O(l)) 
in Fig. [1] 



4.2 Analytic approximations for the volatility 

As we have seen it is possible to compute macroscopic or- 
der parameters of the stationary state, such as the fraction 




Fig. 3. Phase diagram for the batch minority game with uni- 
modal distribution of strategy correlations, P(p') = S(p' — p). 



of frozen agents or the persistent part of the correlation 
function, exactly and in good agreement with simulations 
in the ergodic regime for general values of p. We will now 
turn to the magnitude of the market fluctuations a 2 . By 
performing a direct average in the generating functional, 
it was shown in [H] that the disorder-averaged volatility 
matrix S is proportional to the correlator of the single- 
particle noise in the effective agent process. Generalizing 
the results of [S] to arbitrary values of p we find the fol- 
lowing exact result: 

S tv - (1 - p)A tt , = {l-p) [(II + G)- 1 Z?(U + G T )~ 1 ] w . 

Thus, an exact calculation of the volatility requires the 
computation of this matrix convolution, and hence the 
knowledge of both the long-term and transient behaviours 
of the dynamical order parameters G(t) and C(t) in the 
stationary state. To this end a full computation of the 
time-translation invariant solutions of the self-consistent 
representative agent problem would be necessary. Due to 
the retarded self- interaction and the presence of the coloured 
single-particle noise, this is in general impossible. One 
therefore has to resort to approximations, and the gen- 
erally accepted approach first presented in |Hj aims at ex- 
pressing the market volatility in terms of the persistent 
order parameters c, <f> and x- 

To this end one separates the contribution of the frozen 
agents to the correlation function C from the contribution 
of the fickle agents and writes 



G(t-i') = 0+(l-0) (s P (t)s p (t')) 



fi ' 



(32) 



where the average (. . extends only over fickle agents. 
Inserting this into l|31(l and recalling 



linw 



p 
i-p 



T 1 J2t<T-tt leads to 
(1-0) 



(1 + X) 



x lim r 

T—*00 




fi 
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In [Jj], for p — 1/2, the authors proceeded by keeping only 
the instantaneous contribution, t = t , in the last term. 
Generalizing their calculation to arbitrary < p < 1 one 
obtains the following approximate expression 



a 2 = (l-p) 



P/{1-P) + <P 

(i + x) 2 



(1 



(34) 



Fig.^demonstrates that this approximation is in good 
agreement with data from simulations of the original batch 
process for p > 1/2, but that it becomes less accurate as 
p is reduced. In particular we note that in simulations we 
find volatilities a 2 > 1 in the ergodic phase of batch games 
with largely anti-correlated strategies, i.e. low values of p, 
while Ij34() predicts a volatility below the random trad- 
ing limit a 2 — 1 for all values of p and a > a c (p). We 
have verified explicitly that the discrepancy between this 
analytical approximation and the numerically measured 
volatility can be traced back to the omission of the non- 
instantaneous terms in 133fl : iterating the single-particle 
process using the Eissfeller-Opper algorithm [35], measur- 
ing the contributions from the frozen and fickle agents, 
respectively, and taking into account all terms of (|33|1 re- 
stores the agreement with direct numerical simulations of 
the batch process. The discrepancy between the approx- 
imation of Eq. I|34|) and the volatility measured in simu- 
lations becomes extremal in the fully anti-correlated case. 
In this case one finds no frozen agents, and inserting p = 
and (f> = into (|3"P gives a 2 = 1 for a > a c (p — 0) = 1. 
In numerical simulations of the batch process, however, 
we find a decreasing relation a 2 = a 2 (a), and a 2 = 1 is 
only approached in the limit a — > oo, see Fig. In the 
next section, we will study the case of fully anti-correlated 
strategies in more detail, and will derive different analyt- 
ical approximations for the global market fluctuations in 
this case. 



4.3 The fully anti-correlated case 

In the special case of completely anti-correlated strategies, 
p = 0, and tabula rasa starts, we observe experimentally 
that the system is always in a fully oscillating phase, i.e. 
that all agents switch strategies at each time step, and that 
accordingly C(r) = (— 1) T for all r. Eq. (|2T)l for p = is 
indeed solved by c = so that no frozen agents are possi- 
ble. We will use this observation as an ansatz, which will 
allow us to proceed analytically and to give an approxi- 
mate expression for the volatility, which is different from 
the one (derived for general p) in the previous section. 
Its derivation is similar to the analysis of the non-ergodic 
state in the limit a — > of the standard batch MG [H]. 
In this limit of the game with uncorrelated strategies a 
similar oscillatory state is found. 

For p — 0, we have D(t) = C(r) = (— 1) T and us- 
ing Eq. (|31|l . we conclude that the correlation matrix of 
the single-particle noise in the stationary state is given 
by A tt , = {-ly-t'T 2 , where Y = £ T (-l) r (lI + G)~ x (t). 
Thus, for any fixed realization of the single-particle noise 
rjo(t) must be of the form rjo(t) — Yz(— 1) , where z is 



10' 
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Fig. 4. Volatility as a function of a for p = for the 
batch game. Connected symbols are from simulations started 
from different initial conditions (circles are tabula rasa starts, 
|gi(0)| = 0, squares are |gi(0)| = 2.0 and diamonds |§i(0)| = 
10.0). The solid and dashed lines are the approximations 
of Eqs. 141H and ll 12t respectively. The vertical dot-dashed 
line marks the analytically predicted location of the phase 
transition at a c (p — 0) = 1. Inset: a 2 vs a for p = 
0.0,0.01,0.02,0.03,0.04,0.05 from top to bottom {tabula rasa 
starts) . 

a (static) Gaussian random variable of zero mean and 
unit variance (note that the subscript in t]q (t) indicates 
that we are concerned with the case p = in this sec- 
tion, similarly for qo(t) and so below). A given stochastic 
value of z determines the oscillation amplitude and sign 
of the corresponding trajectory of the noise r] (t). The 
volatility in this fully oscillatory state can be obtained as 



lilH7 



t<T An 



T , so that it remains to 



compute Y. Upon introduction of 

So = lim t- l > Y-lJ'sgnfooM] 



t<T 



(35) 



we can use the identity 

V^E Mt)vo(t")) Vo Gt't" - (sgnMOM*))^ (36) 
t" 

(obtained by an integration by parts in the generating 
functional 6 ) to write Y as 

r = 1 - 4= (soz) z , (37) 

where (. . .) z denotes an average over the standard Gaus- 
sian variable z. We have also used the fact that ^ T {— l) r (U- 
G)-\t) = [l + E.Mrar)]- 1 here®. 

In order to compute the average (§oz) z , we note that 
from the effective single-agent process we can derive the 

relation 

Aq = -aYs + ^/afj (38) 

between the staggered averages Sq and 

Ago = lim r- 1 y)(-l)*(9o(* + 1) - <?o(*)), (39) 

fj = lim 7-- 1 £(-!)*»»(*)• (40) 

t<T 
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Fig. 5. Amplitude of oscillations of the correlation function as 
a function of a for the batch game for p = 1/2 (filled markers) 
and p = (open markers). Results are from simulations with 
N = 500 agents, run for 500 batch steps and averaged over 
10 realisations of the disorder. In the simulations we measure 
ci in the stationary state as ci = (2T) _1 X^t=t |C( r + 1) — 
C(r)|, where T = 125 batch steps. Circles are data for tabula 
rasa starts, squares from simulations with biased starts (qo = 
0.5). The vertical dashed lines mark the locations of the phase 
transition of the games with uncorrelated strategies at a c (p = 
1/2) « 0.3374 and at a c (p = 0) = 1 for fully anticorrelated 
strategies. 




Fig. 6. Persistent part c of the correlation function for the 
on-line game with pi = p for all i and with tabula rasa initial 
conditions. Connected symbols are data obtained from simula- 
tions for N = 300 players, measured over 25000 on-line steps, 
preceded by an equilibration period of 25000 steps. The curves 
are p = 0.9, 0.8, . . . , from top to bottom. The solid lines are 
the theoretical predictions for the ergodic regime and have been 
continued as dashed lines by extrapolation into the non-ergodic 
phase below a c , where they are no longer valid. The slight dis- 
crepancy between the theoretical lines and the numerical data 
close to the predicted breakdown of the ergodic theory is due 
to finite-size effects. 



Since the effective agents are purely oscillatory, one has 
sgn(Z\qo) = —so. Using Eq. (p?%f and the relation r]o(t) = 
Tz{— 1)* we therefore require sgn[z] = — sq, i.e. s$z = —\z\ 
for any given realization of z with ^fST\z\ > aT (and the 
corresponding realization of the effective agent process). 
Since T comes out positive this is the case for realiza- 
tions with \z\ > ^/a. For \z\ < ^fa both signs sqz = ±|2| 
are possible; using the Eissfeller-Opper algorithm we have 
checked that indeed both signs are taken in this case. A 
similar complication has been encountered in the context 
of mixed Minority/Majority Games in |12|. 

We have two possibilities to proceed: firstly, we can 
assume that sqz = — \z\ also for —y/a < z < ^/a, and 
that deviations from this behaviour will only have a small 
effect on the volatility. We then end up with the following 
approximation 

^( 1 + _L (w) /f 
= ( 1 + ^) 2 - < 41 > 

Alternatively, we can assume that both signs §qz = ±|z| 
are taken equally often for \z\ < y/a so that this interval 
does not contribute to the average (s z) z . We then find 

As depicted in the main panel of Fig.^the approximations 
of 1)41(1 and 1(42(1 appear to form upper and lower bounds of 



the volatility, respectively. Both approximations become 
exact only in the limit a — > 0, but reproduce the behaviour 
of a 2 = er 2 (a) quite well, even for larger values of a m 1. 

We conclude this section by mentioning that the batch 
MG with fully anti-correlated strategies, p — 0, is differ- 
ent from the cases < p < 1 in two respects: (i) the 
fully anti-correlated case exhibits oscillatory behaviour, 
C(r) = (— 1) T , for all a, even above a c (p = 0) = 1, 
whereas in all other cases oscillations decay above a c and 
persist only in the non-ergodic phase, where one finds cor- 
relation functions of the form C(r) = cq + ci(— 1) T for 
r > 5 . This is illustrated in Fig. where we plot c\ as 
a function of a for different values of p; (ii) the volatility 
plotted as a function of a exhibits a minimum at a c {p) 
for all < p < 1, but is decreasing monotonically for 
p = 0, see the inset of Fig. Nevertheless the station- 
ary volatility in the fully anti-correlated case does depend 
on initial conditions below a c (p — 0) = 1, whereas the 
starting point is irrelevant for a > 1 . 



4.4 Comparison with the on-line game 

We will now turn to the case of on-line learning rules, and 
will discuss the influence of the timing of adaptation of the 
agents on the behaviour of MGs with correlated strategies. 

5 Below a c we find C(r) = Co + ci(— 1) T for r > within the 
experimental accuracy and with positive fitting parameters Co 
and ci. We observe that Co + c\ < 1 unless the case p — 
or the limit a — > is considered. Trivially one always has 
C(r = 0) = 1. 
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'_ Fig. 8. Volatility as a function of a for the on-line game at 

j p = 0, and for different values of the initial bias |gj(0)|. All 

J | data is from simulations with TV = 300 agents averaged over 

10 10 samples of the disorder. 



Fig. 7. Volatility as a function of a for different values of p 
for the on-line game {pi = p for all agents). Solid lines are the 
approximation of l|44|l for the ergodic phase and are continued 
as dashed lines into the non-ergodic phase (where the ergodic 
theory is no longer valid) . Markers are from simulations of the 
on-line process with TV = 300 agents (TV = 1000 for p = 0.9), 
averaged over 10 samples of the disorder, and run for 10 5 on- 
line time steps. Circles: tabula rasa start (<Ji(0) = 0), squares: 
biased starts (|gi(0)| =3). 

A generating functional theory for the on-line game 
with random external information and uncorrelated strate- 
gies has been worked out in [23- The analysis has to deal 
with the explicit dependence on the external information, 
and inevitably requires a much more elaborate mathemat- 
ical formalism than the analysis of the batch game. The 
resulting theory is easily adapted to the case of general 
strategy assignments with arbitrary correlations. We will 
not report the mathematical details of the calculation, but 
will only state that, in the ergodic regime, it leads to self- 
consistent equations for the persistent order parameters 
(such as c, <p and x), which are identical to the ones of the 
batch MG with general strategy correlations. As a conse- 
quence, the location of the phase transition (marked by 
an onset of diverging integrated response) is identical in 
batch and on-line games with the same distribution P(p) 
of correlation parameters. Results for the persistent corre- 
lation c as a function of a are compared with simulations 
of the on-line game with unimodular strategy correlations 
in Fig. El The on-line simulations shown in Fig. essen- 
tially also match those of the batch case shown in Fig. [JJ 
for the ergodic region. 

Numerical simulations, however, reveal crucial differ- 
ences between the volatilities of the on-line and batch 
games with correlated strategies. A comparison of Figs. 
2] and [7J shows that in their ergodic states the volatility 
of batch games can take values both above and below the 
random trading limit a 2 = 1 , whereas that in on-line 
games never exceeds a 2 = 1. The difference between the 



volatilites of batch and on-line games becomes maximally 
pronounced in the fully anti-correlated case, compare Figs. 
0] and In the on-line game with p = we find that the 
volatility is equal to one for all a > a c (p = 0) = 1, whereas 
it approaches this random trading limit only asymptoti- 
cally for a — > oo in the batch case. Allowing the agents 
to switch stategies only every M time steps according to 
the update rule J2J one can interpolate between the batch 
and on-line cases 6 : Fig.Elshows the volatility a 2 as a func- 
tion of a for the fully anti-correlated case for different in- 
termediate choices of M. While small values of M <C P 
produce market fluctuations close to those of the on-line 
game (which corresponds to M — 1), the batch limit is 
obtained for M ><D{P). 

Similarly to the batch case, a computation of the tran- 
sient parts of the dynamical order parameters would be 
necessary to obtain exact expressions for the volatility 
of on-line MGs. As this is in general not feasible, one 
proceeds as in the batch case, and tries to find approx- 
imate expressions in terms of the persistent order param- 
eters only. Such an approach was first proposed for on-line 
games in 1211 , and is slightly different from the approxima- 
tions in the batch case. The approximation is based on the 
observation that in mean-field disordered systems with de- 
tailed balance fluctuation-dissipation relations can be used 
to 'transform away' non-persistent parts of the response 
and correlation functions without changing averages of 
single-time observables and averages of two-time quanti- 
ties at infinite time-separations. For equilibrium systems 
with detailed balance this approach is exact [27J ■ It can be 
used to extract the static order parameters as obtained in 
a replica-symmetric equilibrium approach from the gener- 
ating functional equations. Details of this procedure can 
be found in [27025] . 

Although the MG does not exhibit detailed balance a 
similar ad-hoc procedure for removing the non-persistent 
parts of the order parameters has been successfully em- 

6 Note also that some work on mixed populations of agents 
with individual updating frequencies Mi is currently being fi- 
nalised by other authors |29|. 
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Fig. 9. Volatility of a MG with fully anti-correlated strategies 
(pi = for all i) and in which switching of strategies is allowed 
only every M time steps. The individual curves are for different 
values of M. All data is from simulations performed at a con- 
stant number P = aN = 50 of possible states of the external 
information p and were run for 10 6 on-line steps. Results are 
averages over 20 realisations of the disorder. The corresponding 
curves for the on-line and batch games are displayed as filled 
symbols for comparison. The lines connecting the markers are 
guides to the eye. 

ployed to obtain approximations for the volatility in |21| . 
It amounts to assuming that C and G are of the following 
form: 



C tv 



<l-c)S t 



G, 



x*(i- 



-Sy- 1 '- 1 (for t > t'), 
(43) 

with the limit 6 — > to be taken at the end of the calcu- 
lation. In this way one preserves 

limr—nx, C(t) = c and ^ r G(r) — Xi but removes non- 
persistent contributions. 

This approach is easily generalised to the case of uni- 
modular distributions of the correlation parameter, and 
upon making the standard assumptions on time-translation 
invariance and ergodicity, one obtains the following ap- 
proximate expression for the volatility 



(1-P) 



(i + x) 2 +(1 " c) 



(44) 



which differs only slightly from the approximate result for 
the batch game, Eq. (|34|l . As shown in Fig. this approx- 
imation is accurate in the ergodic phase for all values of p, 
even in a regime where the corresponding approximation 
for the batch MG is not satisfactory to describe the volatil- 
ity measured in numerical experiments. We attribute the 
deviations slightly above the transition to finite-size ef- 
fects. Applying the same method to the volatility of the 
batch game also leads to Eq. I|44l) and hence to the same 
qualitative and quantative discrepancy between the nu- 
merically measured and analytically estimated volatilities 
in batch games with largely anti-correlated strategies. See 
also [201 for alternative derivations of Eq. (I44|) for batch 
and on-line games with uncorrelated strategies. 

Eq. Q44H agrees with the approximation for the volatil- 
ity derived from the replica analysis of the MG with di- 



versified strategies 0. We note that the analogue of the 
removal of the non-persistent parts of C and G in the 
dynamical approach is reflected by an assumption on the 
cross-correlations between the agents in the replica for- 
malism of |2] • Based on the assumption that (siSj) = 
(si) (sj) for i 7^ j the authors of [2] neglect a term A = 
N ^ 1 il ll L E^C^((^ s i)-( s ») ( s j))> in the ergodic phase 
(a > a c ). Here again Si(t) — sgn[c^(i)], and (. . .) denotes 
an average over time. While our findings concerning the 
volatility suggest that this assumption is appropriate in 
the on-line game, it appears to be inaccurate in the batch 
game at low values of p. We have confirmed this numeri- 
cally in simulations of the batch MG. At fixed a > 1 we 
find that A = A(a,p) is close to zero for large values of 
p ~ 1, but increases as p approaches zero. While we do 
not depict these results here, we will only point out that 
for p = we find oscillatory behaviour, Si(t) — Sj(0)(— 1)*, 
at all a, so that (sj) = 0, but (SiSj) = ±1. One then has 
(s{Sj) — (sj) (sj) — ±1 ^ so that the above approxima- 
tion fails. 

To conclude the discussion of the volatility in on-line 
models, we would like to mention that the volatility of 
MGs with real histories and fully anti-correlated strategies 
behaves qualitatively like the one of the on-line MG with 
random history, with a 2 — 1 for a > 1 

Finally, in this section, let us briefly address the role 
of global oscillations in on-line MGs. As an analytical 
treatment of this dynamical feature would require a so- 
lution of the transient behaviour of the dynamical order 
parameters (which is still awaited) the results in the re- 
mainder of this section and in section 14.51 are all based 
on direct numerical simulations of the games under con- 
sideration. Anti-persistent behaviour in the on-line MG 
with uncorrelated strategies and with real market histories 
was identified first in :2Sjj where the authors find oscilla- 
tory behaviour below a c , but no persistent oscillations in 
the high-a phase, in qualitative agreement with results for 
batch games [H] , recall also Fig. [5] In |21] the authors con- 
sidered an on-line version of the MG in which the agents 
use only the sign of the total bid to update their strategy 
scores, i.e. a model, in which A(£') in Eq. is replaced 
by 7r(£') — sgnL4(£')]. We will refer to this as the 'sign- 
update' rule, as opposed to the linear relation Q. The 
authors then consider a conditional correlation function 



CW(r) = P 



p 



■k{1)tt{1 + t)\p) 



(45) 



in the stationary state, where the average (tt(£)tt(£ + r)\p) 
extends only over times I and £ + r for which p,(£) = 
p(£ + r) = fi. Similarly we can define a conditional 'spin- 
spin' correlation function according to 

p N 

C ss (r) = (NP)- 1 Y, E ( s ^( £ + T )\ti ■ ( 46 ) 

f-L—l i—1 

Fig. demonstrates that the games both with un- 
correlated strategies (p = 1/2), and with anti-correlated 
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Fig. 10. Correlation functions for the on-line game with real 
market histories. All data are from simulations at P = aN — 
128, run for 10 on-line time steps. Left column: p = 0.5, the 
oscillatory curves are at a — 0.1, below the transition, non- 
oscillatory curves at a = 1.25, above the transition. Right col- 
umn: p — 0.0, oscillatory curves a — 0.1, non-oscillatory curves 
a — 1.58. The upper two panels show the 'spin-spin' correlation 
function, C ss (r), conditional on the information pattern, the 
lower two panels the conditional correlation function, C W7r (r), 
of the sign of the total bid n(£) — sgn[A(£)] (see text for fur- 
ther explanantions) . Averages over 10 to 100 realisations of 
the disorder are taken. Results for C V1T at p = are displayed 
as running averages over 64 points to smoothen the noisy raw 
data. 



strategies (p = 0), respectively, show anti-persistence in 
their non-ergodic states, a < a c (p), but not above a c : 
Ctttt and C ss exhibit oscillations of period 2P below a c , 
but approach a constant value above the transition. In 
particular the behaviour of the on-line game with anti- 
correlated strategies appears crucially different from its 
batch counterpart in this respect: in the batch game with 
anti-correlated strategies we find oscillatory behaviour for 
all values of a, whereas in the on-line case with real his- 
tories and sign-updates, they are found only in the low-a 
phase. 

Before turning to the next section, we would like to 
remark that the oscillations below a c , first reported in |2~6"] . 
are generally not detected very easily in on-line games: 
we have tested several other variations of the model and 
observables and found that no oscillatory behaviour can be 
observed if (i) the linear update is used instead of the 
sign-update prescription, (ii) if unconditional correlation 
functions are considered instead of conditional ones or if 
(iii) real market histories are replaced by fake histories. 

While no persistent oscillations are found in uncondi- 
tional correlation functions of on-line games with random 



a 

Fig. 11. Amplitude of oscillations of the (unconditional) spin- 
spin correlation function of the on-line game with adaptation 
allowed only every M time steps. Oscillations are of period 2M. 
All data is from simulations, the lines connecting the markers 
are guides to the eye. Solid symbols: p — 1/2 (squares M — 
10P, diamonds M = 25P, circles are the data for the batch 
limit, plotted for comparison, see also Fig. [S} - Open symbols: 
p = (diamonds M = 25P, right triangles M = 100P, left 
triangles M = 1000P, circles are the data for the batch limit). 
All simulations are performed at P = 50 and run for 10 6 on- 
line steps, averages over 20 realisations of the disorder are taken 
|32|. Stars are the data for randomized updating (p — 0, M = 
10P, see Fig. 1121 for simulation details). No oscillations are 
found in this case. 



histories, oscillations emerge gradually for a < a c (p — 
0.5) in the case of uncorrelated strategies, and for all a in 
the anti-correlated case, as the time lag M between two 
successive strategy updates is increased to approach the 
batch limit, see Fig. (oscillation amplitude for on-line 
games is not shown, but is indistinguishable from zero on 
the scale of Fig. El- 



4.5 Random timing of adaptation 



Finally we have performed numerical simulations on MGs 
with asynchronous, random timing of adaptation. Choos- 
ing M > 0{P) and allowing all agents independently and 
randomly with probability 1/M to update their strategy 
preferences at a given on-line step, while still updating 
their score difference at each step, generates a batch-like 
model with asynchronous updating. As depicted in Fig. 
^2 the randomization of the updates removes the oscilla- 
tions in the spin-spin correlation functions of the batch 
game. The corresponding volatilities are virtually identi- 
cal to those of the corresponding on-line games above a c , 
in particular we find a 2 — 1 for a > 1 in the game with 
random updating and full anti-correlation, see Fig. 1121 In 
the non-ergodic phase the effect of the stochastic updating 
is a gradual reduction of the volatility. 
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5 Mixed population of speculators and 
producers 

The theory of section|3allows one in principle to study the 
batch MG for an arbitray distribution P(p) of correlation 
parameters. While the previous section dealt only with 
the case of unimodular distributions, we will now use the 
above formalism to study a mixed population of 'specula- 
tors' and 'producers' While a speculator is defined as 
a normal agent (holding two strategies), a producer is an 
agent with limited choice and has only one single strategy 
at his or her disposal (or equivalently two identical strate- 
gies, corresponding to full correlation, p — 1). A detailed 
analysis of the statics of games with such mixed popula- 
tions is contained in In this final section before our 
conclusions we will complement this work by a study of 
the dynamics of such models, and will demonstrate that 
the dynamical theory reproduces the results of the static 
replica analysis. The interplay of producers and specula- 
tors is also discussed in a different context in where 
so-called grand-canonical MGs are considered 7 ; see also 
PU) for further details. 

We will consider an ensemble of N agents, consisting of 
(1 — x)N speculators (with correlation < p < 1 between 
their strategies) and xN producers, where < x < 1. This 
corresponds to a choice 



P(p') = (l-x)5(p'-p) + x5(p'-l) 



(47) 



in the above generating functional calculation. As before, 
the parameter a = P/N is defined as the ratio between 
the number of patterns and the total number of agents. 
Using Eqs. (|23> and (J2EI) we find 



< = ~ — =(1 - x)c(p) 



X = 



1-/3 
1-P 



1-P 



-(1 - x)x(p), 



(48) 
(49) 



where ~p = (1 — x)p + x. c and \ are then determined 
self-consistently using expressions (12311 and (|24[) for c(p) 
and x(p)- We have checked and confirmed these analytical 
results for c against simulations for some choices of the 
model parameters, but do not report the numerical data 
here. After some more algebra we find 



a/(l - x) 
erf (A/V2) 



- 1 



(50) 



where A is fixed as the root of the equation 



2 e 



-A 2 /2 



a/(l 



A 2 



1 



(l-p)(l- x )- 



(51) 



7 In the grand-canonical games studied in | 31| speculators 
are agents who hold only one strategy (as opposed to two in 
the present paper), but in addition they have the option not 
to play at a given time step. Producers are agents with one 
strategy, but who play at every time step. 



From this we locate the onset of diverging integrated 
response as 

A c 



a c = (1 — x)eri 



V2J' 



where A r solves 



2 / Ac 

- erf — — 



7T Xr 



(1-pXl-z)' 



(52) 



(53) 



Note that cv c /(l — a;), depends only on the combination 
(1 — p)(l — x). The resulting phase diagram is depicted in 
Fig- El The above equations obtained from the generating 
functional analysis agree with the corresponding results 
from the replica calculation, as presented in 2 8 . 

Finally the generating functional approach allows one 
to study the influence of decision noise on the phase dia- 
gram and behaviour of MGs ^B] . A corresponding calcu- 
lation for mixed populations of producers and speculators 
shows that multiplicative noise generally reduces the value 
of a c /(l — x), i.e. that the range of the ergodic phase is 
increased when the trading decisions of the producers are 
made stochastically to a certain degree (35- This shift of 
cv c /(l — x) becomes more pronounced as the noise level 
is increased. A similar effect was previously also observed 
in MGs in which the agents trade on different time scales 



6 Concluding remarks 

We have presented an analysis of the dynamics of minority 
games with diversified strategies. Generating functional 
techniques can be used to turn the coupled dynamics of 
N interacting agents with heterogeneous strategy corre- 
lations into an effective single-particle problem. The gen- 
eral case of heterogeneous correlation parameters pi of the 
population of agents (drawn from a distribution P(p)) is 
reflected in the fact that one finds an ensemble of single 
agent processes as the final outcome of the theory, as op- 
posed to just one effective single-agent process for the case 
of uniform correlation parameter (pi — p for all i) . 

In section |21 of this paper we have used the developed 
formalism to study mixed populations of 'speculators' and 
'producers'. The dynamical approach leads to order pa- 
rameter equations which are identical to those obtained 
previously from replica analyses of such models, and ac- 
cordingly the phase diagrams obtained from the statics 
and dynamics coincide. In general one finds that adding 
producers to the MG increases the range of the ergodic 
phase of the game. 

The main focus of our study, however, has been the fur- 
ther analysis of the dynamical effective single-agent prob- 
lem for the cases of batch and on-line games with unimodal 



8 Note, however, that the conventions used in |5] are slightly 
different, there an ensemble of N speculators and pN producers 
is considered (to make is (1 + p)N agents in total), and a is 
defined as the ratio between the number of patterns P and the 
number of speculators, a — P/N. 
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Fig. 12. Volatility of the game with randomized, asynchronous 
strategy updating (allow for strategy switches with probability 
1/M at each on-line step, independently for different agents). 
Upper panel: uncorrelated strategies (p = 1/2), lower panel: 
anti-correlated strategies (p = 0). All data is from simulations 
at P = 50, the top most curves (circles) are for M = 1 (corre- 
sponding to the on-line game), squares M = 0.1P, diamonds 
M — P, triangles M = 10P. Simulations are started from 
tabula rasa initial conditions and run for 200M on-line steps 
(2000A/ steps for M = 0.1P), all data is averaged over at least 
10 realisations of the disorder. The horizontal lines mark the 
random trading limit, a 2 = 1, in both panels |33|. 



distributions of the strategy correlations, along with nu- 
merical simulations and complementing the analysis of the 
statics of such games previously presented in [2]. 

We find that the model with uniform, but general cor- 
relation parameter p exhibits intriguing features, with sim- 
ilarities, as well as crucial differences, compared to the 
game with uncorrelated strategies (p = 1/2). 

The main common features of the games with gen- 
eral correlation parameter are (a) the existence of two 
distinct phases in both the on-line and batch games for all 
< p < 1, with an ergodic state for a > a c (p), in which 
no dependence on initial conditions is found, and a non- 
ergodic phase below a c (p), in which the nature of the sta- 
tionary state depends on the configuration from which the 
dynamics is started; (b) persistent oscillations are present 
in the non-ergodic phases of both the batch and the on-line 
games for all p. Above a c (p), oscillations are absent in on- 
line games with arbitrary correlation parameter < p < 1, 
and in batch games as long as p > 0; (c) for < p < 1 
the transition point between the two phases is marked by 
a minimum of the volatility a 2 = a 2 (a) in both the batch 
and on-line games; (d) we find analytically that the persis- 
tent order parameters in the stationary ergodic state and 
the phase diagram do not depend on the details of the 
update rules (batch versus on-line learning) and that they 
agree with those calculated within the replica symmetric 
approximation; (e) no oscillations are found in simulations 
of games with random asynchronous updating (for neither 
uncorrelated nor anti-correlated strategies), while at the 




a/(1-x) 



Fig. 13. Phase diagram of the batch MG with a mixed 
population of speculators and producers. The line in the 
(a/(l — x), 1/[(1 — x)(l — p)])-plane is given by 1521 . The cross 
marks the location of the phase transition in the standard game 
(g = 0,p = 0.5). 



same time a reduction of the volatility in the non-ergodic 
phases of such games is observed. 

However, the study of games with differently correla- 
ted strategies and different timings of adaptation also re- 
veals some new features and striking differences between 
on-line and batch games, which up to now have not been 
discussed systematically in the literature. These new is- 
sues may be summarised as follows: (i) the dynamics of 
the batch game with fully anti-correlated strategies, p = 0, 
appears different from the batch games with < p < 1 
and from the on-line game with p = 0, as in the batch 
game at p = oscillations of the form C(r) = (— l) r are 
found for all a and not only in the low-a phase; for tab- 
ula rasa initial conditions the volatility a 2 of the batch 
game is a smooth function of a without any minima or 
turning points. Nevertheless we find that the game with 
fully anti-correlated strategies is in a non-ergodic state 
for a < a c (p = 0) = 1 for both batch and on-line learn- 
ing rules, with the usual dependence of macroscopic order 
parameters on initial conditions in this regime; (ii) the 
volatilities in batch and on-line games deviate increasingly 
from each other as the correlation parameter p is lowered; 
by allowing the agents to update their strategy preferences 
synchronously only once every M steps it is possible to in- 
terpolate smoothly between the on-line and batch limits, 
M = 1 and M > Q(aN), respectively; (iii) the available 
approximations for the volatilities of batch MGs, neglect- 
ing the retarded self- interaction of fickle agents, become 
unreliable in this regime of anti-correlated strategy assig- 
ments. Thus, care has to be taken whenever these approxi- 
mations are applied to other extensions of the conventional 
batch MG. The corresponding approximations in the on- 
line case, based on discarding transient contributions to 
the response and correlation functions, however, appear 
to be valid above a c (p) for all values of p, even in the case 
of full anti-correlation. 
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